Approximation of real errors by Clifford channels and Pauli measurements 
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The Gottesman-Knill theorem allows for the efficient simulation of stabilizer-based quantum error-correction 
circuits. Errors in these circuits are commonly modeled as depolarizing channels by using Monte Carlo methods 
to insert Pauli gates randomly throughout the circuit. Although convenient, these channels are poor approxi- 
mations of common, realistic channels like amplitude damping. Here we analyze a larger set of efficiently 
simulable error channels by allowing the random insertion of any one-qubit gate or measurement that can be 
efficiently simulated within the stabilizer formalism. Our new error channels are shown to be a viable method 
for accurately approximating real error channels. 



I. INTRODUCTION 



Quantum computation requires the mitigation of errors that 
occur due to faulty controls and unwanted interactions with 
the environment 01 0]. Fault-tolerant quantum error correc- 
tion is one method for mitigating these errors with the advan- 
tage that provable arbitrary quantum computation is possible 
given constraints on the error rates and the error locality JH-0]. 

There are many possible error correcting codes J3, 0-EH] 
and the mapping of abstract models involving qubits on a 
completely connected graph to a more realistic local archi- 
tecture leads to a number of choices that makes analytical 
comparison of codes difficult. In these systems it is typical 
to use simulation to determine the error correcting properties 
1 16- Jjjl] • Although simulation of quantum systems is difficult 
IU9ll20ll . simulation of error correction can be done efficiently 
for stabilizer codes where the process of error correction only 
includes gates in the Clifford group I2lll22ll . 



A standard error model is a depolarizing channel where a 
Pauli operator from a chosen probability distribution is ap- 
plied at every possible error position [f^L EH EH HH. The 
depolarizing channel efficiently simulates common laboratory 
processes such as dephasing. It also serves as a good approx- 
imation for most error process that lead to a steady-state in 
which the system becomes maximally mixed. These are uni- 
tal channels that map completely mixed states to completely 
mixed states. 

In nature it is also common to encounter interactions with 
the environment that lead to non-unital error channels in 
which the maximally mixed states are not a fixed point of 
the error process. One example is amplitude damping where, 
given enough time, all density matrices map to a single pure 
state. If an error channel is far from unital, then simulating it 
with Pauli errors gives large approximation errors making it 
hard to extract useful results. 
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In this paper, we go beyond simulating errors with the con- 
ventional Pauli depolarizing channel (PC). Instead of only re- 
stricting to Pauli errors, we allow any subset of efficiently 
simulable gate errors to occur. In particular, we look at sub- 
sets generated by including all Clifford group operators and/or 
Pauli measurements to the PC channel. We show that adding 
Clifford errors and/or measurement errors always results in 
more accurate approximations and results in significant im- 
provements for most error channels. We consider an approxi- 
mating error channel to be valid if it has a smaller fidelity than 
the target error channel and choose the best valid approxima- 
tion by minimizing the Hilbert-Schmidt metric. 

The paper is organized as follows. In SectionHIlwe first de- 
scribe the simulable error processes and introduce our expan- 
sions to the PC. We then mention two important error chan- 
nels that cannot be simulated in the stabilizer formalism and 
finally we describe our method for approximating a general er- 
ror channel with our new models. In Section Hill we compare 
how well these models approximate the two error channels 
mentioned in Section [Tl] and also a collection of random er- 
ror channels. In Section |IV] we conclude and describe future 
research directions. 

Throughout the paper we use ci = X, er 2 = Y, and 
(73 = Z to represent the Pauli matrices with associated eigen- 
vectors { I +) , I -) }, { I + i) , I - i) }, and { |0) , 1 1) } respectively. 



II. ERROR CHANNELS 

It is convenient to consider the interaction of the environ- 
ment with the system for a fixed time. Then the system dy- 
namics can be represented by a set of time-independent Rraus 
operators that form an error channel. 

We begin by considering all error channels that can be sim- 
ulated efficiently within the stabilizer formalism. Next we ex- 
amine two specific error channels that are outside of the sta- 
bilizer formalism. Finally, we discuss a method by which we 
create an error channel that approximates a target channel. 
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A. Efficiently simulable error processes 

The stabilizer formalism allows for efficient simulation on 
a classical computer of operators from the Clifford group op- 
erating on states stabilized by Pauli operators [22]. The Clif- 
ford group for n-qubits can be generated from CNOTs and the 
1-qubit Clifford gates. As error channels, the probabilistic ap- 
plication of 1-qubit Clifford operators can be represented by 
the following Kraus operators: 



TABLE I. Kraus operators corresponding to the 4 efficiently simula- 
ble error channels. 



Channel Label 


Kraus error set 


PC 


{Ei} 


PMC 


{Ei,E lf){f \,E lf)(f ± l } 


CC 


{Ei , Es, ±j ,Ej t ± k ,Eg,} 


CMC 


{Ei, Es,±j,E j>±k , Ep, -E|/)</| , E ]f){f±l } 



• Identity 

E = 

• Pauli operators 

Ei = yfpl<Ji 

• S-like operators 

Es,±j = y/ps&exp(-ij{±a-j)) 

• Hadamard-like operators 

Ej,±k = y/Pj^kk exp(-i§ ^ {a 3 ± a k )) for k > j 

• Rotations about the face centers 

Ep = y/pp exp(—i^<Tpt) , where op = F ■ a and F is 
the unit vector from the origin to one of the eight faces 
of the 1-qubit Clifford octahedron JH], 



The stabilizer formalism also includes non-unital operators. 
The simplest are measurement operations in the Pauli basis. 
More intricate non-unital Kraus operations can be represented 
as measurements followed by gates conditioned on the mea- 
surement outcomes. 

We limit ourselves to non-unital operators that result in 
translations along the Pauli axes. For each eigenstate, /, of 
a Pauli operator, we define the following two Kraus operators 
with the same classical probability: 

• Measurement-induced translations 



l/X/l = 



Notice that the effect of these two operators is to discard 
the state with a probability of and replace it by |/). The 
effect on a state, when represented on the Bloch sphere, is to 
translate it toward | /) . 

To ensure trace preservation, we set po = v/l — Tl a Pa 
where a sums over all other operators. 

Throughout the paper we will refer to four sets of these 
error process: PC, CC, PMC, and CMC. The Pauli Channel 
(PC), introduced above, is limited to Pauli errors. The Clifford 
Channel (CC) includes all efficiently simulable unitary gates 
rf25tl . The Pauli and Measurement Channel (PMC) includes all 
Pauli errors and all measurement-induced translation errors. 
Finally, the Clifford and Measurement Channel (CMC) in- 
cludes all Clifford errors and all measurement-induced trans- 
lation errors. We use these as approximation channels to the 
error channels presented below. 



B. Examples of non-Clifford error channels 

1. Amplitude damping 

The amplitude damping channel ADC, represented in 
Equation Q] is the prototypical non-unital error channel l26ll . 
The ADC describes the energy dissipation of a two-level 
quantum system. However small, it is present in any non- 
degenerate physical system. 



ADC = 



£ao = |0}<0| + VT^|1)(1| 
£Ui = V7|0)<l| 



(1) 



The rate of the energy loss to the environment, or damping, 
is given by the dimensionless parameter 7, which can take any 
real value between and 1 112711 . 

Numerous codes have have been developed specifically to 
combat ADC, but studying the effects of this error channel 
on a circuit has yielded only a handful of results Il4l28l430ll . 
All of the results assume 7 to be small in order to expand the 
Kraus operators in a Taylor series expansion using the Pauli 
operator basis. 



2. Polarization along an axis in the X-Y plane 



Another interesting error channel is a polarization along a 
non-Pauli axis. Specifically, we focus on a polarization along 
an axis in the X-Y plane of the Bloch sphere: 



PoUC 



E X yO — y/1 — P<j>I 

E xy i = JP4, [cos(</>) X + sin(0) Y] 



(2) 



where the parameter (ft represents the angle of the polarization 
axis with respect to the X axis and p^ the probability of error. 

Unlike the ADC channel, Pol^C is unital. Yet unless the an- 
gle <p is a half-integer multiple of tt, the depolarization occurs 
along a non-Pauli axis, and the quality of the PC approxima- 
tion will vary with (ft. 
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C. Evaluating the approximations 

To study how closely our error models approximate target 
error channels, we compute the distance between the process 
matrix of our error model and the process matrix of the tar- 
get error. For an error model with n operators (including the 
identity), this distance is a function of the n — 1 linearly in- 
dependent probabilities associated with the operators. As a 
distance measure we employ the normalized Hilbert-Schmidt 
distance HH] . This distance ranges from (for two identical 
channels) to 1 (for two orthogonal channels). 



D{xuX2) = i^txi-X2\\ 2 HS ( 3 ) 



\\A\\ HS = s/r^A) (4) 

Throughout this paper N = 2, since we will only focus on 
the one-qubit errors. After calculating the distance, we then 
minimize it over the n — 1 independent variables. 

As our goal is to understand for which cases this error 
model would be an appropriate approximation, we want our 
model to be an upper bound to the error induced on the sys- 
tem. Therefore, we perform the distance minimization with 
the constraint that the fidelity between the identity channel 
and our error model is not greater than the fidelity between 
the identity channel and the target error. This constraint en- 
sures that our approximation will not underestimate the real 
target error. 

F(I, Target) > F(I, Model) (5) 
The fidelity can either be an average fidelity: 

F av {V,K) = ^Y.\ Tx ^ K ^ 2 ( ® 

i 

where {Ki} are the Rraus operators of the error channel K 
and V is a unitary transformation, or a worst-case fidelity: 

F W (V, K) = min V \Ti(V^K iP )\ 2 (7) 

p e D — » 

i 

where in this case the fidelity is minimized over all the density 
matrices p. The minimization was performed with Python's 
sequential least squares programming minimization subrou- 
tine. 

We use the Hilbert-Schmidt distance for most of the anal- 
ysis here due to ease of computation, but the method works 
for any distance measure or constraint 13211 . In most cases, the 
worst case fidelity constraint would be appropriate for calcu- 
lating lower bounds on error correction thresholds. For certain 
cases, such as Pol^C, the two constraints give the same results. 
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FIG. 1. Minimum distance between two approximate error models 
and the amplitude damping channel as a function of 7, the damping 
strength. Although not shown, the results for the CC and CMC are 
exactly the same as the results for PC and PMC, respectively. The 
inset figure, a zoomed version of the same plot, gives an idea of how 
fast an error model without measurement-induced translations be- 
comes an inaccurate approximation compared to an error model that 
includes them. For small values of 7, however, both distances scale 
quadratically. 



III. RESULTS 

A. Amplitude Damping Channel (ADC) 

Figure[TJshows the results of the approximation of the ADC 
by the error models introduced in Section [II] with the average 
fidelity constraint. Each one of the 200 points corresponds to 
a numerical minimization for a particular damping strength. 
After fitting these points and then solving symbolically, for 

both the PC and the CC the distance between the ADC and 

2 

the best approximation was found to be Dp = where 7 
is the damping strength. This means that as the non-unital 
character of the ADC becomes more pronounced, the unital 
error models give less an accurate approximation. The larger 
repertoire of operators in the Clifford group does not improve 
the approximation obtained with only Pauli operators. 

On the other hand, the addition of the measurement- 
induced translations considerably improves the approxima- 
tion. In this case, the distance between the approximation 
and the ADC is given by D m = (7-i)(7+2vT^ a and me 
PMC and CMC significantly outperform the models without 
measurement for 7 > 0.05. The PMC and CMC can match 
the ADC perfectly only for 7 = 0, which corresponds to the 
trivial case, and 7=1, which corresponds to a measurement 
that is actually part of the operator repertoire of our error 
model. Interestingly, despite the large amount of operators in 
the CMC error model, the best approximation only employs 
the identity and the translation towards |0) and it is given by 

{£ = \/t — p^i, e-l = ypwr|o> <o| , e 2 = ^|o>(i|}, 

with p m = i(l + 7 — \A — 7)- ^ i s aiso noteworthy 
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that, for small 7 values, D m = 32 — gj + CK7 4 ), while 

'v 2 

Dp = although the measurement operators improve the 
approximation even for small 7 values, both methods have a 
quadratic dependence on 7. 

When the constraint is changed from the average fidelity to 
the worst fidelity, then the PC and CC approximations have 
a distance of D P>W = 27 2 -3 7 +2+27V^7-2v^7 ^ while the 
PMC and CMC have a distance of D m ^ w = 2D m . Both of 
these cases result in larger distances than the ones with the av- 
erage fidelity approximation and the difference between mod- 
els with and without measurements is even more pronounced. 

The results obtained by the average fidelity and the worst 
fidelity contraints are best illustrated in Figure [2] Here we 
examine, for 7 = 0.25, the closest PC (a) and PMC (b) ap- 
proximation assuming either one of the two constraints. The 
figure shows a cross section of the Bloch sphere and its trans- 
formation by the ADC and the closest approximate channel 
with either the average fidelity constraint (red) or the worst 
fidelity constraint (blue). Notice that for these error channels 
the deformed Bloch sphere is still symmetric with respect to 
rotations around z, so a cross section is enough to visualize 
the whole process. 

The approximation using the worst fidelity constraint guar- 
antees that the largest distance between any input and the tar- 
get channel output will be less than the largest distance be- 
tween any input and the approximate channel output. In this 
case, for both the ADC and its approximations the largest 
discrepancy between input and output occurs when the ini- 
tial state is Notice that for the PMC approximation this 
constraint also guarantees that for all inputs the approximate 
channel outputs are further from the input than for the target 
channel. This is pictorially represented in Figure 0b), where 
the blue curve is always inside the green curve or further away 
from the initial states (black curve). For the PC, however, this 
is not the case, as Figure|2ja) shows. Here the blue curve lies 
outside the green curve for some points. Indeed, if we use a 
unital channel to approximate a non-unital one, it is impossi- 
ble to satisfy the condition that for every input the approxi- 
mate channel output will be further from the input than for the 
target channel. Simply consider the maximally mixed state, 
which is mapped to itself by a unital channel, but mapped to a 
different state by a non-unital one. 



B. Polarization along an axis in the X-Y plane (Pol^C) 

Figure [3] shows the results of the approximation of the 
Pol^C by the error models introduced earlier. Once again, 
each one of the 200 points corresponds to a numerical min- 
imization. Because of the unital nature of this channel, it is 
the addition of the Clifford operators rather than the measure- 
ment operators that improve the approximation. For both the 
PC and the PMC, the distance between Pol^C and the best 
approximation was found to be Dp = jp 2 sin 2 (20). When 
the Clifford operators are included in the approximate chan- 
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FIG. 2. Cross-sectional view of the Bloch sphere and the effect 
of amplitude damping and two approximations with different con- 
straints. AFA stands for average constraint approximation, while 
WFA stands for worst constraint approximation, a) Channels without 
measurement operators, b) Channels with measurement operators. 
For both cases, 7 = 0.25. 



nel, the new distance is reduced to Dq = ^|p 2 (sin(2</>) + 
cos(2(/>) - l) 2 for < <j> < tt/4 and for p < 0.9 At 
the worst point of the CC (which in this interval occurs at 
4> = 7r/8, 37t/8), the PC is 6.8 times worse. Notice that not 
only the distance is decreased; the period of the distance func- 
tion is also reduced from f to because between every two 
Pauli axes there is a Clifford axis. 

Once again, despite the large amount of operators in the 
CMC, the best approximation uses a small number of them: 
the identity and the two axes closest to the polarization 
axis. If we only employ Pauli axes, the best approximation is 
{E = y/T=3l, E l = Jp cos(0) X , E 2 = ^sin(0)F}, 
where cf> and p are the same as in Equation (2). If we em- 
ploy the whole Clifford group, the best approximation is given 
by{£o = V 1 - Pi -P2I , Ei = y/plX , E 2 = ^/pIHxy}, 
where H XY = -^(X + Y), jn = f (3 + 4cos(20) - 

3sin(2(/))),andp2 = f (3 - 3 cos 20 + 4 sin(20)). Finally, as 
mentioned before, for this error channel there is no difference 
between the results obtained with either fidelity constraint. 
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FIG. 3. Minimum distance between several approximate error mod- 
els and the polarization along an axis in the X-Y plane of the Bloch 
sphere as a function of the polarization angle. Although not shown, 
the results for PMC and CMC are the same as the results for PC and 
CC, respectively. The distances scale quadratically with p, so the 
results are normalized by p 2 . 



C. Random Error Channels 

We have seen that the addition of the measurement-induced 
translations and the Clifford operators improves the approx- 
imation of two specific error channels. To determine how 
the method works for general errors, we generated 2000 ran- 
dom process matrices and computed the distance of the best 
approximation that each one of the 4 approximate channels 
could make. For the 1-qubit case, a process matrix is a 4 x 4 
Hermitian positive matrix M with 4 constraints: Tr(M) = 2, 
Re(Moi) = -Im(M 23 ) , Re(M 02 ) = Im(M 13 ) , and Re(M 03 ) = 
-Im(Mi2) ll34ll . To generate this matrix we first create a 4 x 4 
diagonal matrix D with real, positive diagonal entries that add 
to 2. We then create a 4 x 4 random unitary matrix U and ap- 
ply this unitary transformation to D to obtain M — UDIP, 
which is positive with trace 2. We then enforce the last 3 con- 
straints mentioned earlier and keep the random process if the 
matrix is still positive. 



TABLE II. Summary of the approximations obtained with each of 
the 4 error models. 



Channel 


Distance mean 


Distance median 


Distance variance 


PC 


0.043 


0.038 


1.0 x 10" 3 


PMC 


0.029 


0.012 


9.1 x 10~ 4 


CC 


0.015 


0.012 


1.5 x 10~ 4 


CMC 


0.0027 


0.0011 


1.6 x 10" 5 
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Figure|4]illustrates the distance between each random error 
channel and the best approximation as a function of the dis- 
tance between the error channel and the identity channel. The 
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FIG. 4. Distance between the random error channels and the best ap- 
proximations attained with each model as a function of the distance 
between the error and an errorless channel (identity). The slope of a 
line joining the origin and a point represents the distance of the best 
approximation to that error relative to the magnitude of the error. Ev- 
ery approximation includes the errorless channel and this limits the 
distance between the approximation channel and error channel to be 
below a line of slope 1 (black dotted line). 



fidelity constraint guarantees that the approximation will al- 
ways be more distant from the identity than the error is. Notice 
that as the amount of operators in the error models increases, 
both the mean and the median distance between each model 
and the random error decreases and the distributions become 
more compact, as summarized in Table ITfl 

Although the approximations with the CC had a smaller 
mean distance and a more compact distribution than the ones 
with the PMC, it is not clear that this difference is signifi- 
cant. The most important improvement occurs when we add 
both the unital Clifford gates and the non-unital measurement- 
induced translations. For the CMC, for 48% of the generated 
random process matrices the distance of the best approxima- 
tion was less than 0.001, while for the other error channels the 
fraction of approximations with a distance in this interval was 
not greater than 6%. 



IV. CONCLUSIONS 

We have presented an extension to the random Pauli error 
model which is still compatible with efficient simulation using 
the Gottesman-Knill theorem and leads to a computationally 
tractable description of realistic error models like amplitude 
damping. For ease of calculation, we have used the average 
distance as the measure to be optimized under two different fi- 
delity constraints of the error channels relative to an error free 
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channel. We have also only focused on single qubit errors in 
the absence of quantum operations. Once we consider sim- 
ulating operations over many qubits, we will need a distance 
measure that is composable over tensors. A more natural dis- 
tance measure in this regard is the diamond norm |[35ll . 

Our method can be extended to multi-qubit channels but the 
optimization becomes more difficult as the number of Clifford 
operators grows quickly with n. In many cases, symmetries 
of the underlying error channels will minimize the number of 
Clifford operators that must be considered. In future work, 
we will compare for a specific error correction circuit how 
the logical error rate compares for the models. We expect in 
the case of multiple rounds of error correction a substantial 
difference between error models with distinct fixed points. 
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